Classifying Emails into Human vs Machine Category
نویسندگان
چکیده
It is an essential product requirement of Yahoo Mail to distinguish between personal and machine-generated emails. The old production classifier in was based on a simple logistic regression model. That model trained by aggregating features at the SMTP address level. We propose building deep learning models message train four individual CNN models: (1) content with subject as input, (2) sender email name (3) action analyzing recipients’ patterns generating target labels senders’ opening/deleting behaviors (4) salutation utilizing "explicit salutation" signal positive labels. Next, we final full after exploring different combinations above models. Experimental results editorial data show that our improves adjusted-recall from 70.5% 78.8% precision 94.7% 96.0% compared Also, significantly outperforms state-of-the-art BERT this task. Our new has been deployed current system (Yahoo 6).
منابع مشابه
Human Races: Classifying People vs Understanding Diversity
The idea that all humans naturally belong to one of a few biological types or races that evolved in isolation was unchallenged for centuries, but large-scale modern studies failed to associate racial labels with recognizable genetic clusters. Recently, the conclusions of those studies have been questioned by authors who argue that racial classification has objective scientific bases and is indi...
متن کاملClassifying emotions in human-machine spoken dialogs
This paper reports on the comparison between various acoustic feature sets and classification algorithms for classifying spoken utterances based on the emotional state of the speaker. The data set used for the analysis comes from a corpus of human-machine dialogs obtained from a commercial application. Emotion recognition is posed as a pattern recognition problem. We used three different techni...
متن کاملClassifying Legal Questions into Topic Areas Using Machine Learning
In this paper we describe the steps taken to build a machine learning classifier that successfully classifies legal questions into the most relevant practice area. We have created 16 different general categories that legal questions fall into. Categorizing legal questions into the correct practice area has many useful applications such as facilitating improved realtime feedback, information ret...
متن کاملClassifying medical notes into standard disease codes using Machine Learning
We investigate the automatic classification of patient discharge notes into standard disease labels. We find that Convolutional Neural Networks with Attention outperform previous algorithms used in this task, and suggest further areas for improvement.
متن کاملIntegrating empty category detection into preordering Machine Translation
We propose a method for integrating Japanese empty category detection into the preordering process of Japanese-to-English statistical machine translation. First, we apply machine-learningbased empty category detection to estimate the position and the type of empty categories in the constituent tree of the source sentence. Then, we apply discriminative preordering to the augmented constituent tr...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2022
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v36i7.20666